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Abstract. Given n points in the plane, the degree- i<* spanning tree problem asks for a spanning 
tree of minimum weight in which the degree of each vertex is at most K . This paper addresses 
the problem of computing low- weight degree- K spanning trees for K > 2. It is shown that for an 
arbitrary collection of n points in the plane, there exists a spanning tree of degree three whose weight 
is at most 1.5 times the weight of a minimum spanning tree. It is shown that there exists a spanning 
tree of degree four whose weight is at most 1.25 times the weight of a minimum spanning tree. These 
results solve open problems posed by Papadimitriou and Vazirani. Moreover, if a minimum spanning 
tree is given as part of the input, the trees can be computed in 0(n) time. 

The results are generalized to points in higher dimensions. It is shown that for any d > 3, an 
arbitrary collection of points in 3i d contains a spanning tree of degree three, whose weight is at most 
5/3 times the weight of a minimum spanning tree. This is the first paper that achieves factors better 
than two for these problems. 
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1. Introduction. Given n points in the plane, how do we find a spanning tree 
of minimum weight among those in which each vertex has degree at most A? Here 
the weight of an edge between two points is defined to be the Euclidean distance 
between them. This problem is referred to as the Euclidean degree- A' spanning 
tree problem and is a generalization of the Hamilton Path problem which is known 
to be NP-hard M, ||. When K = 3, it was shown to be NP-hard by Papadimitriou 
and Vazirani |15|, who conjectured that it is NP-hard for A = 4 as well. When K = 5, 
the problem can be solved in polynomial time [T^] . 

This paper addresses the problem of computing low weight degree- A spanning 
trees for A > 2. In any metric space, it is known that there always exists a spanning 
tree of degree 2 whose cost is at most twice the cost of a minimum spanning tree 
(MST). This is shown by taking an Euler tour of an MST (in which each edge is 
taken twice) and producing a Hamilton tour by short-cutting the Euler tour. In the 
case of general metric spaces, it is easy to generate examples in which the ratio of 
a shortest Hamilton path to the weight of a minimum spanning tree is arbitrarily 
close to two. But such examples do not translate to points in In view of this, 
Papadimitriou and Vazirani fig ] posed the problem of obtaining factors better than 
two for the Euclidean degree-A spanning tree problem. It should be noted that in 
the special case of A = 2, Christofides § gave a simple and elegant polynomial time 
approximation algorithm with an approximation ratio of 1.5 for computing a traveling 
salesperson tour for points satisfying the triangle inequality (points in a metric space) . 

1.1. Our Contributions. In this paper, we show that for an arbitrary collection 
of n points in the plane, there exists a degree-3 spanning tree whose weight is at most 
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1.5 times the weight of a minimum spanning tree. We also show that there exists a 
degree-4 spanning tree whose weight is at most 1.25 times the weight of a minimum 
spanning tree. This solves an open problem posed by Papadimitriou and Vazirani [ fl5| . 

Moreover, if a minimum spanning tree is given as part of the input, the trees can 
be computed in 0(n) time. Note that our bound of 1.5 for the degree-3 spanning 
tree problem is an "absolute" guarantee (based on the weight of an MST) as opposed 
to a "relative" guarantee for the degree-2 spanning tree obtained by Christofides || 
(based on the weight of an optimal solution). 

We also generalize our results to points in higher dimensions. We show that for 
any d > 2, an arbitrary collection of points in 5R d contains a degree-3 spanning tree 
whose weight is at most 5/3 times the weight of a minimum spanning tree. This is 
the first paper that achieves factors better than two for these problems. 

1.2. Significance of Our Results. Many approximation algorithms make use 
of the triangle inequality to obtain approximate solutions to NP-hard problems. These 
algorithms typically involve a "short-cutting" step where the triangle inequality is used 
to bound the cost of the obtained solution. Examples include Christofides' heuristic for 
the traveling salesperson problem ||, biconnectivity augmentation approximate 
weighted matching p| , prize-collecting traveling salesperson JJ, and bounded-degree 
subgraphs which have low weight and small bottleneck cost |lq| . 

A question of general interest is how to obtain improved approximation algorithms 
for such problems when the points come from a Euclidean, as opposed to arbitrary, 
metric space. This requires making use of more than just the triangle inequality. 
Surprisingly, for most problems, improved algorithms are not known. (A notable 
exception is the famous Euclidean Steiner tree problem [|-) We use rudimentary 
geometric techniques to obtain an improved algorithm for the Euclidean degree-if 
spanning tree problem. 

The key to our method is to give short-cutting steps that are provably better than 



implied by the triangle inequality alone. Lemma 3.3, which bounds the perimeter of 
an arbitrary triangle in terms of distances to its vertices from any point, is typical of 
the techniques that we use to get better bounds. 

1.3. Related Work. Papadimitriou and Vazirani showed that any MST whose 
vertices have integer co-ordinates has maximum degree at most five jl5]| . Monma and 
Suri showed that for every set of points in the plane, there exists a degree-5 MST. 

Many recent works have given algorithms to find subgraphs of bounded degree 
that simultaneously satisfy other given constraints. A polynomial-time algorithm to 
find a spanning tree or a Steiner tree of a given subset of vertices in a graph, with 
degree at most one more than minimum was given by Fiirer and Raghavachari 
This was extended to weighted graphs by Fischer |Q . He showed how to find minimum 
spanning trees whose degree is within a constant multiplicative factor plus an additive 
O(logn) of the optimal degree. The degree bound is improved further in the case 
when the number of different edge weights is bounded by a constant. Ravi, Marathe, 
Ravi, Rosenkrantz and Hunt [ |l(| consider the problem of computing bounded-degree 
subgraphs satisfying given connectivity properties in a graph whose edge weights 
satisfy the triangle inequality. They give efficient algorithms for computing subgraphs 
which have low weight and small bottleneck cost. Salowe fill , and Das and Heffernan 
Q consider the problem of computing bounded-degree graph spanners and provide 
algorithms for computing them. Robins and Salowe ftn\ study the maximum degrees 
of minimum spanning trees under various metrics. 
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2. Preliminaries. Let V = {vi, . . . ,v n } be a set of n points in the plane. Let 
G be the complete graph induced by V, where the weight of an edge is the Euclidean 
distance between its endpoints. We use the terms points and vertices interchangeably. 
Let uv be the Euclidean distance between vertices u and v. Let T m j n be a minimum 
spanning tree (MST) of the points in V. Let w(T) denote the total weight of a 
spanning tree T. Let Tk denote a spanning tree in which every vertex has degree at 
most k. Let degr(v) be the degree of a vertex v in the tree T. Let AABC denote 
the triangle formed by points A, B and C. Let LABC denote the angle formed at B 
between line segments AB and BC . Let ABC denote the perimeter of AABC; and 
more generally, let V1V2 ■ ■ ■ Vk denote the perimeter of the polygon formed by the line 
segments ViVi+i for 1 < i < k, where v^+i — v\. 

In this paper we prove the following: for an arbitrary set of points in 

(1) 3T 3 : w(T 3 ) < 1.5 X w(T min ) 

(2) 3Ti : w{T A ) < 1.25 x w(T min ) 

For an arbitrary set of points in 5R d (d > 2), 

(3) 3T 3 : w(T 3 ) < - x w(T min ) 

3. Points in the plane. We first consider the case of 3? 2 - points in the plane. 
We first note some useful properties of minimum spanning trees in 5ft d . 

Proposition 3.1 (p5|). Let AB and BC be two edges incident to a point B in 
a minimum spanning tree of a set of points in #t d . Then LABC is a largest angle in 
AABC. 

Corollary 3.2. Let AB and BC be two edges incident to a point B in a 
minimum spanning tree of a set of points in 5R d . Then 

• LABC > 60° 

• LB AC, LBCA < 90°. 

3.1. An upper bound on the perimeter of a triangle. We now prove an 
upper bound on the perimeter of an arbitrary triangle in terms of distances to its 
vertices from an arbitrary point. This lemma is useful in proving the performances of 
our algorithms. The lemma is also interesting in its own right and we believe that it 
and the associated techniques will be useful in other geometrical problems. 

Lemma 3.3. Let X, A, B, and C be points in di d with ~XA < XB,XC. Then 

(4) ~ABC < (3\/3 — A)XA + 2{XB + X~C). 

Note that 3^/3 — 4 w 1.2. Recall that ABC is the perimeter of the triangle and XY 
is the distance from X to Y . 

Proof. Let B' and C be points on XB and XC respectively such that XA = 
XB' = XC (see Fig. [j]). First we observe that the lemma is true if it is true for the 
points X,A,B' and C. This follows because by the triangle inequality, 

ABC < AB'C + 2BB 7 + 2CC 7 . 

By our assumption, 



AB'C < (3V3 - A)XA + 2{XB> + XC). 
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Fig. 1. Shrinking to obtain canonical form. 

Combining the two inequalities yields the desired result. Therefore in the rest of the 
proof, we show that the lemma is true when the "arms" XA, XB' and XC are equal. 

It is not very difficult to see that to maximize the perimeter of the triangle, X 
will be in the plane defined by A, B' and C, and thus X is at the center of a circle 
passing through A, B' and C . 

By scaling, it suffices to consider the case when the circle has unit radius. In 
this case, the right-hand side of (Q) is exactly 3\/3. Thus, it suffices to show that 
the maximum perimeter achieved by any triangle whose vertices lie on a unit circle is 
3\/3. This is easily proved Q. □ 

Note that in an arbitrary metric space it is possible to have an (equilateral) 
triangle of perimeter six and a point X at distance one from each vertex. 

3.2. Spanning trees of degree three. We now assume that we are given a 
Euclidean minimum spanning tree T of degree at most five. We show how to convert 
T into a tree of degree at most three. The weight of the resulting tree is at most 1.5 
times the weight of T . 

High Level Description: The tree T is rooted at an arbitrary leaf vertex. Since T 
is a degree-5 tree, once it is rooted at a leaf, each vertex has at most four children. 
For each vertex v, the shortest path P v starting at v and visiting every child of v is 
computed. The final tree T3 consists of the union of the paths {P v }- Fig. || gives the 
above algorithm. In analyzing the algorithm, we think of each vertex v as replacing its 
edges from its children with the path P v . The above technique of "shortcutting" the 
children of a vertex by "stringing" them together has been known before, especially 
in the context of computing degree-3 trees in metric spaces (see Jl6|, |l8||). 

Tree-3(V,T) - - Find a degree 3 tree ofV. 

1 Root the MST Tata leaf vertex r. 

2 For each vertex v £ V do 

3 Compute P v , the shortest path starting at v and visiting all the children of v. 

4 Return T3, the tree formed by the union of the paths {Pv}- 

Fig. 2. Algorithm to find a degree 3 tree. 

Note: Typically, the initial MST has very few nodes with degree greater than 
three In practice, it is worth modifying the algorithm to scan the vertices in 
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preorder, maintaining the partial tree T 3 of edges added so far, and to add paths to 
T 3 as follows. When considering a vertex v, if the degree of v in the partial T 3 is two, 
add the path P v as described in the algorithm. Otherwise its degree is one and, in 
this case, relax the requirement that the added path must start at v. That is, add the 



shortest path that visits v and all of v's children to T 3 (see §3.2). This modification 
will never increase the cost of the resulting tree, but may offer substantially lighter 
trees in practice. 

Lemma 3.4. The algorithm in Fig. outputs a spanning tree of degree three. 

Proof. An easy proof by induction shows that the union of the paths forms a tree. 
Each vertex v is on at most two paths and is an interior vertex of at most one path. 
□ 

Lemma 3.5. Let v be a vertex in an MST T of a set of points in Let P v be 
a shortest path visiting {v} U childr(v) with v as one of its endpoints. 

w(P v ) < 1.5 x Wi. 

viechild T (v) 

By the above lemma, each path P v has weight at most 1.5 times the weight of 
the edges it replaces. Thus, 

Theorem 3.6. Let T be a minimum spanning tree of a set of points in 5R 2 . Let 
T 3 be the spanning tree output by the algorithm in Fig. 

w(T 3 ) < 1.5 x w{T). 



Proof of Lemma 3.5. We consider the various cases that arise depending on the 
number of children of v. The cases when v has no children or exactly one child are 
trivial. 

Case 1: v has 2 children, V\,V2- There are two possible paths for P v , namely 
Pi = [v,vi,v 2 ] and P 2 = [v,v 2 ,vi]. Clearly, 

w(P v ) = mm(io(Pi), W[P 2 )) < = — + — + viv 2 < 1.5 {vv\ +vv 2 ). 

Case 2: v has 3 children, v\,v 2 ,v^. Let v\ be the child that is nearest to v. Consider 
the following four paths (see Fig. ||): Pi — [v,v%, V2,v^\,P 2 — [v, vi, V3, v 2 ], P 3 = 
[v,V2,Vi,v 3 ] and P A = [v,v 3 ,V!,v 2 ]. 

The path P v is at most as heavy as the lightest of {P±, P 2 , P 3 , P4}. The weight 
of the lightest of these paths is at most any convex combination of the weights of the 
paths. Specifically, 

MP.) < minMPr), MP 2)) W (P 3 ), < ^ + ^ + ^ + 

3 3 6 6 

We will now prove that 

w(Pi) , w(P 2 ) w{P 3 ) w{P 4 ) 



3 3 6 6 

This simplifies to 



< 1.5 (wi + vv 2 + vv 3 ). 



v\v 2 + v 2 v 3 + v 3 vi < 1.25 vvi + 2(vv 2 + vv 3 ), 
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V 2 Vi 



v 2 v 1 




Fig. 3. T3, three children 



which follows from Lemma 3.3 



Case 3: v has 4 children, v±,V2,V3, i>4, ordered clockwise around v. Let v' be the point 
of intersection of the diagonals V1V3 and V2V4. Note that the diagonals do intersect 



because the polygon V1V2V3V4 is convex (follows from Corollary 3.2) 



Let V3 be the point that is furthest from v', among {vi, V2, V3, V4}. Consider the 
following two paths (see Fig. |J): Pi = [v, v 4 , v 1} v 2 , v 3 ], P 2 — [v, v 2 ,vi,V4, v 3 }. 




V4 



v 2 




v 2 



Clearly, 



Fig. 4. T3, four children 



w(P v ) < min(w(P 1 ),w(P2)) < + 



We will show that 

^(w(P 1 ) + w(P 2 )) < 1.5(vvJ + WD2 +VV3 + Vvt)- 

This simplifies to 

(5) V1V2V3V4 + (vru-2 + Wv£) < 3(otJT + VV3) + 2(vv2 + vvt). 
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We will first prove that 



(6) fiW2«3W4 + (viV2 + V1V4) < 3(v'vi + v'v 3 ) + 2(v'v 2 + v'Vi). 

Once we prove (^|), by the triangle inequality we can conclude that (|5|) is true. 
(Since vv± + W3 > V1V3 — v'vi + v'v^ and W2 + W4 > V2V4 = v'v-i + v'v±.) 

We prove (^|) by contradiction. Suppose there exists a set of points which does 
not satisfy (Q). Suppose we shrink v'v^ by 8. The left side of the above inequality 
decreases by at most 28, whereas the right side of the inequality decreases by exactly 
3<5. Therefore as we shrink v'u 3 , the inequality stays violated. Suppose v'vz shrinks 
and becomes equal to another edge v'vi for some % € {1,2,4}. We now shrink both v'v^ 
and v'vi simultaneously at the same rate. Again it is easy to show that the inequality 
continues to be violated as v'vs and v'vi shrink. Hence we reach a configuration where 
three of the edges are equal. 

Without loss of generality, the length of the three edges is 1 and the length of the 
fourth edge is some e < 1. 

There are two cases to consider. The first is when v'vi — e and the second is 
when v'v2 = e. (The case when v'v^ = e is the same as the second case.) 
Case 3a. v'vi = e. We wish to prove that 

V!V2V 3 V4 + (W[V2~ + Wvt) < 7 + 3e. 

We want to show that the function F{e) — viv^v^vi + (i> 1^2 + wIW) — 7 — 3e 
is non-positive in the domain < e < 1. Simplifying, we get 

F(e) = 2viV2 + V2V3 + V3V4 + 2viV4 — 7 — 3e. 

Each of ViVj in the definition of F is a convex function of e due to the following 
reason. Let p be the point closest to Vj on the line connecting and v' . 
Observe that as Vi moves towards v', ViVj decreases if Vi is moving towards p 
and increases otherwise. Since F is a sum of convex functions minus a linear 
function, it is a convex function of e. Therefore it is maximized at either e = 
or e = 1. 

When e = 1, all four points are at the same distance from v' . If angle 
Lv\v'v\ = a then F can be written as a function of a single variable a and 
it can be verified that F reaches a maximum value of 10\/0^8 — 10, which is 
non-positive. 

When e = 0, viv 2 — V1V4 = 1- Simplifying we get F = V2V3 + — 3, and 
it reaches a maximum value of 2^/2 — 3, which is non-positive (when e = 0, 
note that v\ is the midpoint of the line segment V2V4). 
Case 3b. v'v2 = e. We wish to prove that 

V1V2V3V4 + (vTv2 + WW) < 8 + 2e. 

We want to show that the function F'(e) — tJTWWW + (WW + WW) — 8 — 2e 
is non-positive in the domain < e < 1. 

As a function of e, function F' is a sum of convex functions minus a linear 
function, and thus is convex. Therefore it is maximized at cither e = or 
e = 1. 

The case e = 1 leads to the same configuration as in Case 3a. 

When e = 0, uTW = WW = !■ Here F' — 2vTv4 + TF3V4 — 5. If angle 

Lviv'vi = a, then F' can be written as a function of a single variable a and 
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it can be verified that F' reaches a maximum value of 5V0.8 
non-positive. 



5, which is 



This concludes the proof of Lemma 3.5 



The example in Fig. || shows that the 1.5 factor is tight for the algorithm in Fig. ||, 
modified according to the note following its description. The same example also shows 
that the 1.5 factor is tight for the unmodified algorithm since the unmodified algorithm 
never outputs a lighter tree than the modified algorithm. Each curved arc shown in 
Fig. H is actually a straight line, and has been drawn curved for convenience. The 
vertex that is the child of the root has three children, and is forced to drop one child. 
In doing so, the degree of its child goes to four, and it in turn drops one of its children. 
The algorithm could make choices in such a way that the changes propagate through 
the tree and the tree T3 output by the algorithm may be as shown in the figure. The 
ratio of the cost of the final solution to the cost of the MST can be made arbitrarily 
close to 1.5. See §|| for a discussion on the worst case ratio between degree-3 trees 
and minimum spanning trees. 



Root 




Root 



MST 




Fig. 5. Bad example for algorithm in Fig. 



3.3. Spanning trees of degree four. We now assume that we are given a 
Euclidean minimum spanning tree in which every vertex has degree at most 5. We 
show how to convert this tree to a tree in which every vertex has degree at most 4. 
High Level Description: The basic idea is the same as in the previous algorithm. 

The difference is that we don't insist that each path P„ start at v. The tree is 
rooted at an arbitrary leaf. For each vertex v, the minimum weight path P v visiting 
v and all of v's children (not necessarily starting at v) is computed. The final tree T4 
consists of the union of the paths {P v }- Again, for the analysis we think of each path 
P v replacing the edges between v and its children in T. 

Lemma 3.7. The algorithm in Fig. returns a degree-4 spanning tree of the given 
set of points V. 

Proof. A proof by induction shows that T4 is a tree. Each vertex v occurs in at 
most two paths and thus has degree at most four. □ 
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Tree-4(V, T) - - Find a degree 4 tree of V . 

1 Root the MST T at a leaf vertex r. 

2 For each vertex v £ V do 

3 Compute the shortest path P v visiting v and all its children. 

4 Return T4, the tree formed by the union of the paths {P v }- 

Fig. 6. Algorithm to find a degree 4 tree. 



Lemma 3.8. Let v be a vertex in an MST T for a set of points in -ft 2 . Let P v be 
the shortest path visiting {v} U childrp{v). 

w(P v ) < 1.25 x ^2 vvl. 

viechild T (v) 



From the above lemma, each path P v weighs at most 1.25 times the net weight 
of the edges it replaces. Thus, 

Theorem 3.9. Let T be a minimum spanning tree of a set of points in 5ft 2 . Let 
T4 be the spanning tree output by the algorithm in Fig. |^. 

w(T 4 ) < 1.25 x w(T). 



Proof of Lemma $.c\ The proof is similar to the proof of Lemma |3.5| . As before, we 
consider cases depending on the number of children of v. The cases when v has no 
children, one child, or two children are trivial. 

Case 1: v has 3 children, Vi,V2,V3. Let v\ be the point that is closest to v, among 
its children. Consider the following four paths (see Fig. 0): Pi — [vz, Vi,v,v$\, P2 = 
[v 2 ,v,v 1 ,v 3 ],P 3 = [v 1} v,V2,v 3 ] and P 4 = [v 1 ,v,v 3 ,v 2 ]. 




Fig. 7. T4, three children 



Clearly, 

W (P V )<^ + ^1 + ^1 + ^1. 
3 3 6 6 

We will show that 

w(P t ) w(P 2 ) , w(P 3 ) w(P 4 ) 2 + V3 
^- + ^- + -6- + ^" -3-(t«i+W5 + OTS5). 

9 



This proves the three-child case because 2+ /^ approximately equals 1.244 and is less 
than 1.25. This simplifies to 

V3, 



V\V 2 + V\V 3 + ^2«3 



VVi 



VV 2 



W3 < 2_ 



-(vvi + vv 2 + vv 3 ), 



which further simplifies to 
(7) 



V1V2V3 < 1)oti + (V%+ 7j){w2 +vv 3 ). 



Since V\ is the closest point to v, applying Lemma 3.3, we get 
V1V2V3 < (3\/3 — 4)vv± + 2(vv2 + W3). 

and hence 

V1V2V3 < (Vo — l)vvi + (2vo — 3)vvi 



< (V3- l)wT+ (V3 + ~)(iW2 



2(lW 2 + OT3) 



This proves (@). 

Case 2: i> /ias ^ children, v\, v 2 , 1*3, U4. Assume that i>i is the point that is closest to 
among its children. Let the order of the points be Vi,V2,Vs,V4,, when we scan the 
plane clockwise from v, starting from an arbitrary direction. 

There are two cases, depending on whether V4 or 113 is the point that is furthest 
from v among its children. We first address the case when V4 is the furthest point. 
(The proof for the case when V2 is the point furthest from v is symmetric to the case 
when V4 is the furthest point.) 

Consider the following paths: Pi = [V4, v±, v, V2, v 3 ] and P2 — [V4, v 3 , v, v\, V2] (see 
Fig-!). 





Fig. 8. T4, four children 

The path P v added by the algorithm is at most as heavy as the lighter of the 
paths Pi and P%- Hence 



w(P v ) < min(Pi,P 2 ) < 
10 



w(P 1 )+w(P 2 ) 



We will show that 



w(P 1 )+w(P 2 ) 

Simplifying, we need to show that 
I. . 5, . 

-(V4V1 + V\V + VV 2 + V 2 V 3 + V4V3 + V 3 V + VVi + V1V2) < -^{VVi + VV 2 + VV 3 + VV4). 

Further simplifying, we get: 

1 5 3, . 

viv 2 v 3 V4 < 2 VVl 2 VV4 + 2 2 + W ' 

Note that if it happens that v 3 was the farthest point from v, among its children, 
we get a similar equation with i>3 and V4 being exchanged in r.h.s of the equation. By 
symmetry, the case when 1)2 is furthest is similar to V4 being farthest. 

Without loss of generality, WO3 > TFU2 . The proof now proceeds in a manner similar 



to the proof of Lemma 3.3. If there is a configuration of points for which this equation 
is not true (the l.h.s exceeds the r.h.s) then we can move V4, v 3 closer to v until otJ = 
vv 3 = Wi. In doing this, we decrease the l.h.s by at most 2(vv4 — W2) + 2(vv 3 — vv^). 
Clearly, the r.h.s decreases by exactly 4(vi>4 — WU2) + 4(uu3 — vv 2 ) ■ This ensures that 
the l.h.s is still greater than the r.h.s. Hence without loss of generality, if there is a 
configuration for which our equation is not true then there is a configuration with the 
property that vvl = vv 3 = W2- We now show that when this property is true there 
is no counter-example. 

By scaling, we may assume that WUZ — vv 3 — vv~2 = 1, and vv\ = e, where e < 1. 



Note that (by Corollary 3.2) v was originally within the convex hull of its four 
children. Also (by Corollary 3.2), every child is on the convex hull. These properties 
are both maintained by the above shrinking steps. 

We now wish to prove that 

11 1 

V\V2V 3 V4 < — + -€. 

It is easily shown using elementary calculus that for any e such that v± is on the convex 
hull of the points {v%, . . . , V4}, rotating v% and v 3 around v until Lv\W2 — Lvyuv^ 
(see Fig. ^ and /.V2W3 — IV4W3 does not decrease the perimeter. Also, it maintains 
that V\ is on the convex hull. Assume the two pairs of angles are equal, and define 
F(e) = V1V2V3V4 — e/2 — 11/2. We will show F is non-positive over the domain of 
possible e. 

As a function of e, function F is a sum of convex functions minus a linear function, 
and thus is convex. Therefore, F is maximized either when vv\ = 1 or when v\ is the 
midpoint of edge V2U4 (since v\ is on the convex hull, v\ can not cross the edge, hence 
this interval contains all possible values for e). 

In the first case, all four points lie on a unit circle with center at v. For any four 
such points, it is easily proven using calculus that v\V2V 3 Vi is maximized when the 
four points are the vertices of a square at 4-\/2 ~ 5.66. Thus, -F(l) < 0. 

In the second case, VTU2V3V4 — V2V3W4. As noted previously, this is at most 
3V3 w 5.2. Thus, F(e) < 0. 

We now deal with the case when v 3 is the furthest point. In this case we take 
the paths Pi = [v±, Vi,v, V2,v 3 ] and P 2 — [v 3 , U4, v, Ui,^]. The path P added by the 
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Fig. 9. Figure to illustrate degree four case. 



algorithm is at most as heavy as the lighter of the paths Pi and Pi- Hence, 
w(P)<min(P 1 ,P 2 )< w{Pl)+ 2 W(P2) . 

Simplifying, we get 

1 5 3, , 

ViV 2 V 3 V 4 < -Wx + -VV 3 + -{VV 2 + VV4). 

The proof of this is identical to the proof of the previous case. □ 

4. Points in higher dimensions. We show how to compute a degree-3 tree 
(T3) when the points are in arbitrary dimension d > 3. The algorithm for computing 
the tree is similar to the algorithm for computing degree three trees in the plane — 
the tree T3 is formed by rooting the MST and taking the union of the paths {P v }, 
where each P v is the shortest path starting at v and visiting all of the children of v in 
the rooted MST. It is known that any Euclidean MST has constant degree jlTj (for 
any fixed dimension), so that the algorithm still requires only linear time. The bound 
on the weight of T3 is similar, except that v may have more children. We prove that 
regardless of the number of children that v has, the weight of P v is at most 5/3 the 
weight of the edges that it replaces: 

Lemma 4.1. Let {v,vi,v 2 , ■ ■ ■ , ffc} be a set of arbitrary points in di d . There is a 
path P, starting at v, that visits all the points v\ 1 v 2 , ■ ■ ■ , Vk such that 

5 k 

w(P) < r^Wi. 



Proof. We prove this by induction on the degree of v. Sort the points in increasing 
distance from v as Vx, . . . , Vk- Let v — vq. The lemma is trivially true when fc = 0, 1, 2. 
Let us assume that the lemma is true for all values of k up to some £ > 2. Consider 
k = £ + 1. By the induction hypothesis, the claim is true when v has fc — 3 children; 
hence we can find a path P' that starts at v and visits all vertices Vi (i = 1, . . . , k — 3) 
(not necessarily in that order) such that w(P') < | Stf^W- Let Vj be the last 
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vertex on the path P'. We add the cheapest path P" that starts at Vj and visits 
v k-2i v k-i an d Vk (again, not necessarily in that order). This path together with P' 
will form a path that starts at v and visits all vertices adjacent to v. We now show 
that 

5 

(8) W(P") < -(Wk-2 + VVk-l +VVk). 

This suffices to prove the lemma. Let Pi, . . . , Pq be the six possibilities for P" . Clearly, 

6 

() 



>(p")< Je^)- 



We will prove that 



1 6 5 

Q /s W ( P i) ^ ^( VV k-2 +Wk-1 +VVk) 



i=l 



This simplifies to 

k 

(9) 2 Vk-2Vk-lVk + E - H vv k-2 + Wk-1 + VVk). 

i=k-2 

Notice that if the above equation is not true, we can "shrink" all the Vi (i = 
k — 2, k — 1, k) until vvj = vvj~2 = vvk-i = vvk ■ Assume that 8 — (vvk-2 — vv]) + 
(vvk-i — vvj) + (vvk — Wj ) . This can be done because the r.h.s decreases by 56, and 
the l.h.s decreases by at most 55. If the above equation is not true then it is also not 
true when the distance from v to all the points is the same. By scaling, we can assume 
that the distance of the points from v is 1. We call this a canonical configuration. 



The following proposition is implied by Lillington's work |13 and helps in completing 
the proof. 

Proposition 4.2. Let A, B, C and D be points on a unit sphere in d- dimensions, 
d > 3. The function F = AB + AC + AD + BC + CD + BD reaches a maximum 
value o/4-\/6 when the points A,B,C and D form a regular tetrahedron. 

We will now show that (||) is satisfied by the canonical configuration. The left 
side of (||) can be written as the sum of the sides of the tetrahedron formed by the 
points {ufe, Vfe-i, Vk-2, Vj} and the sum of the sides of the triangle formed by the points 



{vk,Vk-i,Vk-2\- These points lie on a sphere whose center is v. By Lemma 4.2, the 
first sum is bounded by 4^/6. The second sum is bounded by 3^/3- Hence the left side 
of (§) is bounded by 4%/6 + 3%/3, which is about 14.994. The right side of @ is 15. 
Hence (B|) is satisfied by the canonical configuration and therefore all configurations. 



This concludes the proof of Lemma 4.1. □ 

Remark. The algorithm outlined earlier runs in linear time only when d, the number 
of dimensions, is a constant. The algorithm can be modified to run in linear time for 



all d as follows. Observe that in the proof of Lemma 4.1, we considered the neighbors 
of v only three at a time. Therefore the algorithm could also group vertices into sets 
of 3 each, based on the distance from v, and inductively construct the path as in 
the proof of the lemma. This algorithm would have the same performance guarantee 
(5/3) as the earlier algorithm for constructing a degree-3 tree, and in addition have 
the added advantage of running in linear time for all dimensions. 
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5. Conclusions. We have given a simple algorithm for computing a degree-3 
(degree-4) tree for points in the plane that is within 1.5 (1-25) of an MST of the 
points. An extension of the algorithm finds a degree-3 tree of an arbitrary set of 
points in d-dimensions within 5/3 of an MST. If an MST of the points is given as part 
of the input, our algorithms run in linear time. All our proofs are based on elementary 
geometric techniques. 

Though our algorithms improve greatly the best known ratios for each of the 
respective problems, there are still large gaps between the ratios that we obtain and 
the best bounds that we think are achievable. For example, in the case of points in the 
plane, consider the ratio of the weight of a minimum weight degree-3 tree to the weight 
of an MST. The worst example that we can obtain for this ratio is v/ ^ +3 « 1.104 (with 
5 points, where 4 of the points are at the corners of a square and the fifth point is 
in the middle). There is a large gap between this and the ratio of 1.5 obtained by 
our algorithm. Is 1.104 the worst case ratio? Are there polynomial time algorithms 
which obtain factors better than 1.5? Notice that the performance ratio obtained by 
our algorithm on the example in Fig. |^ is highly sensitive to the vertex chosen as the 
root. One potential algorithm is to simply try all possible vertices as the root, and to 
pick the tree of minimum weight. Does such an algorithm have a better performance 
guarantee? 

For the problem of finding degree-4 trees, our algorithm obtains a ratio of 1.25. 
Unlike degree-3 trees, we are unable to show that this ratio is tight for the algorithm. 
Can the factor of 1.25 for the algorithm be improved? The worst example for the 
ratio between a minimum-weight degree-4 tree and an MST that we can obtain is 
about 1.035 (5 points on the vertices of a regular pentagon with a sixth point in their 
centroid). Are there examples with worse ratios? 

Problems of approximating degree-A; trees in higher dimensions and in general 
metric spaces within factors better than 2 are still open. 
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